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1.  Introduction 


We  consider  the  general  linear  model 

0-n  Yi  "  Ti  +  ^v0)^-  Ti  =  S  p<>  n  =  1 _ n)- 

where  is  an  unknown  (p  x  l)  vector,  the  (p  ■  11  vector*-  c!  are  known,  the 
error  terms  are  indepeiuh  n  t  and  identically  distributed  (i.i.d.)  with  common 
distri  ution  function  I  ,  and  o(i  ,0)  expresses  the  possible  heteroscedastici ty 
in  the  model,  with 

(1.2)  =  1  ♦  r'  a(r)  +  0(0)  as  0*0  . 


Bickel  (1978),  generalizing  work  of  Anscombe  (1901),  defines  robust  tests  for 
heteroscedast icity ,  which  in  the  present  context  is  a  test  of  1 10 :  0  -  0;  the  idea 
is  to  replace  aspects  of  the  usual  informal  examination  of  residuals  by  formal 
statistical  inference  about  1  he  probability  structure  of  the  data.  If  it.}  are 
the  fitted  values  (from  least  squares  or  possibly  a  robust  regression  method 
(Huber  (1973) (1977)))  jnd  b  is  an  even  function,  Bickel's  robust  test  statistic  is 


(1.3) 


\  =  ^  (a(t.)  -  ajftnbfr.)/^  , 


where 


and  for  any  function  g. 


2 


Bickel  makes  the  following  assumption : 

(1.6)  h  is  bounded  and  lias  two  continuous,  hounded  derivatives. 

Under  (1.6)  and  other  assumptions  (see  Theorem  1  below),  Rickel  obtains  the 

asymptotic  distribution  of  under  li^:  h  -  0  and  contiguous  alternatives, 

-> 

results  are  obtained  for  the  case  p^/n  -»  0  . 

One  of  the  most  attractive  choices  of  b  (well  motivated  in  Bickel's  Section 
3)  is  Huber’s  function  squared: 

(1.7)  Mx)  -  x2  | x |  _<  k 

=  k2  h!  >  »  . 

This  choice  of  h  does  not  satisfy  (1.6)  *o  that  Rickel’s  Theorem  3.1  does  not 
apply.  He  states  that  the  strong  smoothness  condition  (1.6)  is  "unsatisfactory" 
and  obtains  results  lor  (1.7)  only  when  p  is  bounded  and  fitting  is  by  least 
squares . 

In  this  note  we  show  by  a  simple  modification  of  Bickel’s  proofs  (using 
techniques  of  Carroll  (19781),  that  results  for  Vj  ran  he  "brained  for  b  given 
by  (1.7)  even  when  p  /n  -  0  and  fitting  is  by  robust  estimates  on  least  squares. 
This  result  is  given  in  .Section  2.  In  Section  3  we  note  extensions  *hich  obtain 
scale  invariance  by  robust  estimation  with  scale  estimated  by  Huber’s  Proposal  2. 

2 .  Mai n  Kesul t s 

Where  possible  we  adopt  Bickel’s  notation.  Without  loss  of  generality  w-' 

assume  n  '  c!  c.  =  I.  i>  provide  a  frame  of  reference  wc  state: 

—l  -l  1 

Theorem  !  (Bickel  (1978))  Suppose  the  following  hold: 

it.  |  <  M  . 


’’2.1) 


m;i  x 
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(2.2) 

(2.3) 

(2.4) 

(2.5) 


(2.6) 

(2.7) 

(2.8) 
(7.9) 
(2. IP) 

(2.11) 

Then 


n  1  )'(a(i  .  )  -  a  ( i ) )  “  M~  1  0  , 


18  n  ’  I  •'  M . 


1  is  symmetric  about  zero. 


M  *  <  J0(i:)  <  M  where  for  C  *  f(ahsolutely  continuous)  , 


J,(l)  =  /  lx  f (x)/f(x)  ♦  1)“  f(x)dx, 


Var(b(c ))  >  M'1  ^  0  , 


The  function  a  is  twice  bounde.Uy  and  continuously  differentiable. 


If  =  li  -  V  1  di  =  (Vp)* 


b (x)  =  b(-x)  . 


b  is  hounded  and  satisfies  (1.6), 


|>Vn  0  . 


(2.12)  P0  (A^  a  z\  =  1  -  Hz  -  A,,)  ♦  o(l)  , 

where 

?  ’i  _  t 

(213)  Ajj(0  ,  n)  =  0  (  l  (aftp  -  a  _(!))“]  M'  f-jb'  (tj)  (Var(b  (e. ) )  1 "  1  . 

i  *1 

''ur  generalization  of  theorem  1  to  incorporate  such  functions  s  r\.7)  is 


Theorem  2.  Suppose  (2.1)-(2.9)  and  the  following,  hold: 

(2.14)  b  is  bounded,  l.ipschitz  of  order  one,  and  has  two  bounded  continuous 
derivatives  except  possibly  it  a  finite  number  of  points,  which  we 


take  as  ic. 
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(2.15)  p4/n  ►  0  . 

Then  (2.12)  holds.  lAssumptiou  (2.8)  is  Jiscussed  in  the  next  section.; 

Proof  of  Theorem  2.  Ine  key  results  in  Ridel's  proof  arc  (A54)-(.\t7)  with 

w .  =  1  -  1/n  ( i  -  j ) 

=  -1/n  ( i  /  j )  . 

Because  b  is  bounded  and  Lipschits  of  order  one,  (A54)-(A36)  follow  exactly 
as  given  by  Pickel.  lie  uses  (A37)  to  prove 

.i.  " 

(2.16)  n  J  )  (a ( t . )  -  a  (t) )h(r  ) 

id 

n 

=  n  *  }  (n(  i  )  -  a  ( i  ) ) b ( r . ) 

i-1  1 

" 

♦  n  Mi  b'(.  )  l  (aft. )  -  a  (t))d.  ♦  o  (1)  , 

1  i=l  1  ‘  1  P 

where  d.  t^  •  t..  Instead  of  proving  (A.V’j  we  will  prove  (2  16)  directly. 

As  seen  in  Rickcl's  (AH)-(A47),  (2.16)  is  verified  by  proving  either  (A4S)  (as 
Bickcl  has  done)  or 

i  i  n  n  p 

(2.17)  n'JAa  =  n2  '[  j  w  a(T. )  Vb(r  )  -  b(c  )  ♦  d  b'  (E  ))  l  0  . 

i«l  j-1  J  J  J  J  J 

Re  will  prove  (2.17).  Note  that  in  Hhckcl's  proofs  of  (A41)-(A47)  the  nssu.nptic 

(J .6'  is  not  needed;  the  weaker  assumption  (2.14)  suffices.  Since  r.  =  >■ .  -  d., 

J  J  J 

with  T  being  the  indicatoi  function,  rewrite 

(2.18)  A  =  v.  7  w  .  a  ( i .  )  (b  ( r  -cl . )  •  b  ( i  .  )  ♦  d.  b'(r.)) 

“  1  J  1 J  1  J  J  J  J  J 

*  f  1  (-c  •  a  f  .  c  -  a  )  ♦  I  ( e  -  .i  •  i  .  c  ♦  a  ) 

n  j  n  n  i  -  n 

♦  I(-c  -  a  s»  •'  -c  ♦  a  )  ♦  I  (r .  •  e  ♦  a  )  ♦  Ifr.  <  -c  -  a  )  } 

n  —  j  n  i  -  n  j  —  n 

s  A  ,  ♦  A  _  4  A  v  ♦  \ 

ill  n2  n 3  lit  nr>  ’ 


where  a  *  0  will  be  specified  later.  We  a  1  so  write  A  =  K..(n).  W»  can 

n  n  i  j  ij 

further  write 


(2.19)  A  =  Z.  T..  K.  .  (n) 
nl  l  j  i  j 


I  ( -  e  <  r  .  -  d  .  •  e 


j  j  K  era  . •  c-a  )  .  A(J5  * 

r .-d. I  ,  c)  (  n  j  n  nl  n2 


As  in  Bickel's  (A48) ,  since  b  is  differentiable  mi  (-c,c), 

Ia()}  |  =  t1  (r.  d2)  =  o  (p)  . 

nl  1  p  i  p  *  ’ 

Note  that  if  -c  ♦  a  >  t  .  <  c  -  a  and  h  .  -  d . I  c  then  id. I  a  .  rhea,  bv  (-• 
ri  j  n  1  j  i  [  1  j  1  n  1 

since  b  is  Lipsehitz  aad  w,  ^  -  1/n, 

j,  '  b(,j)  '  h  t>'1  H  * », '  ‘-j  < '  -  vh-V  >« 


1  %  [  |d,l  MM, 


u  }  ■  M ,  ()  <rV-  (V  r  { i  d  .  i  -  a 
n  --  -  i=l  ’  j.l  J  11 


')  *  =  C  (p/.i  )  . 

p  *  n 


Thus  i A  n  l  !  =  CYfp/an)‘  S‘nll,irly’  »  Xn4  1  "  l,p(l,/:ln>  •  I \l5  i  ’  ^p(p/''n} 
1  urthcr,  by  (2.1)  and  since  b  is  Lipsehitz, 


(2.20) 


I An_> I  M,  C  iij) '  (>:.  He 


a  •  r  .  <  c  ♦  a  ) )  2 
n  —  j  —  n 


A  simi 1  r  bound  holds  fur  ;A  ^  |  .  Since  by  (2.S)  1  is  l  ipsehitz  in  neigh  lorhoods 


of  *c,  Lcmr.ia  1  of  Carroll  (1978)  shows 


(2.71)  7  I(c  -  a  v  t  .  o  ♦  a  j  =  0  (max (log  n,  n(F(c  ♦  a  )  -  F(c  -  a  ))) 

,  i  n  —  j  —  n  p  n  n 


Provided  na  _>  log  n,  (2.70)  and  (2.21)  rive 


fA„2  ‘  =  l,p{("  I’an)i)  |Anll  (1P(fn  ran),)  ' 


This  yields 


(2.22) 


n  \,  1  (p/an)) 


If  we  toKc  a^  =  ii  J,  (2,lr.)  and  (2.27)  vieli 
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n''1  A  =  r  ( n  , 
n  p 

completing  the  proof.  [') 

3 •  Extensions 
A .  Scale  invariance 

The  test  statistic  \  is  not  scale  invariant  To  obtain  such  invariance, 
one  would  rewrite  the  model  (1.1)-(1.2)  so  that  (t,d)  =  (1  ♦  aft)')  p  (0)) /u^ , 
where  Og  is  a  scale  parameter  consistently  estimated  fwhen  0  =  0)  by  a 
estimate  a  (provided  by  least  squares  or  Huber's  Proposal  L  for  robust  regression 
To  obtain  scale  invariance,  Hick  el  suggests  replacing  b(r.)  by  b(r^/o) .  The 
statements  and  proofs  of  Theorems  1  and  2  must  bo  modified  for  this  new  test 
statistic  which  we  denote  A^(a).  An  analogue  of  Theorem  2  is 

Theorem  3.  Suppose  the  conditions  of  Theorem  2  hold  and,  an  addition, 

(3.1)  n*  (S  -  o0)  *  0  (1)  . 

(3.2)  P. fb*  (cj)f  t  }2  *  «  . 

(3.3)  P  (b"  Ct  *  «  • 

Then  (2.12)  holds  for  A.(  ) . 

Ren .iik.  Assumption  (.7.1)  i  the  subject  of  part  B  of  this  section.  Assumptions 
(3.*)  and  (3.3)  hold  if  b  is  constant  outside  an  interval  (as  is  (1.7)). 

Cketch  of  the  proof  ol'  Theorem  3.  We  need  to  verify  substitutes  for  Bickel's 

(A3S)  and  (A37)  when  bfr^)  is  replaced  bv  b(r^/o)  ,  b(r-)  is  replaced  by  bie^/c^) 

1 

a. id  the  remainder  terns  are  (respectively)  0  ( (up)  M  and  t^(p).  do  prove  the 


7 


substitute  for  (A35)  one  mist  show  that 


(3.4)  L  w  b(r./d)  b(r./3)  -  Z  w.  .  b(t  ./a)  b(  /■■ ,  =  0  ( fro)  -) 
ij  l  j  ij  i  j  p  1 


( 3 .  S ) 


I  w.  .  h(e  /o)  b(  ,/o)  -  l  w  b(  ./. •  l  !> ( *  =  0  ( (np)  ) 

ij  l  J  ij  l  i>  i  "  p 


Csing  the  special  form  of  w^,  (3.4)  follows  from  (2.8)  ant!  the  fact  that  b  is 
h  unded  an.!  Lipsciiitz;  (3.5)  is  a  consequence  of  (3.1)  and  (3.2).  Statement  (A  7  7) 
is  more  complex.  The  analogue  of  (A42)-(A45)  is  to  show 

l  w  (a(ri)  -  a(t.))  b(»  ./n)  =  0  (p)  , 

i,j  J  ‘ 

for  which  (using  nickel's  proof)  it  suffices  to  show 


r.M 


l  wii  J ' 11 1 )  di(bfci/^)  -  b0  /  ,))  =  0n<P) 

i ,  i  J  J  J 


fcie  lewritc  (3.6)  as 


(3.7)  l  w  a'(,.jd.  (t.  b '  (c./Oq)  -  i;  ,  b’(r1/a0)(l/o  -  1/a  ) 

i.j  J  J  J 

*  l  w  a'ft.JiT  (b(c./3)  -  bfe./cjy)  -  (1/a  -  1 / aQ ) ‘  ^  ^'(k./c^) 

i.j  1  J  J 

=  B  ♦  . 

In  2n 

Th  B  =  £>  (p)  follows  from  using  the  Schwarz  inequality,  the  boundedness  of 

J  Ti  p 

a,  and  then  applying  (3.1, and  (3.2).  That  =  0^(p)  is  complicated  notationally 
but  i  a  consequence  of  a  weakened  version  of  Lemma  2  of  Carroll  (1978).  ibis 
verifies  l3.6). 

The  analogue  of  (A43i  is  to  show  that 


(3.8) 


l  a(t.)(b(r./0)  -  b(i j/uy) )  -  0p(p)  . 

*  »  J 


c 


First  note  that  (3.1)  and  the  proof  or  Theorem  can  he  used  to  show  that 


(3.9) 


/  w.  .  a{  t . )  (b(r  /  )  l»(-  ./o)  «  (d  ')!>'  ft  ./<)))-  0  ( p)  • 

L  ■  1  1  1  I  J  i  j  p  1 


t  .) 


To  verify  (3.8)  we  must  show  that  the  difference  between  (3.S)  and  (3.91  is 
J  (pi ;  this  is  a  consequence  of  the  fullowir.: 


(3.10)  [  w  a(T.)(b(i  /o)  -  b(r  ./o0))  --  C  (p) 

i ,  j  J  J  p 

d 

(3.11)  l  w  oft.)  J-  (b'(e  /o)  -  b*(i  /o0))  =  0  (p) 


(3.12) 


/  w  aft  )  d.  b' (e./un) (1/d  -  1/u.l  -  C  (p)  . 
lj  i  1  J  0  0  p  1 

A  »  J 


Lquations  (3.11)  and  (3.12)  follow  by  applying  the  Schwarz  inequality,  (2.81, 
(2.14),  (3.1)  and  (3.3).  We  c..n  rewrite  (3.10)  as 

(313)  l  w  >i(x. )  (b(*.  ./•  )  -  b(c./0l  •  (1/e  -  l/o0)c  b'(r  /a  » 
i ,  j  J  J  J  J 

♦  w..  a(  t . )  [i. .  b '  (u  .  /  ’  )  -  It,  b*  (»../ct  )  1  f  1/3  -  I/O)  .  [*  ♦  B*„  , 

f  lj  i  1  t  ]0  1  10'  0  n 1  n  2 

l 


the 


last  step  following  since  Y  w.  a(t.)  -  0.  Tint  15*,  -  0  (p)  follows  us  in 
r  ij  t  nip' 


the  proof  of  Theorem  2,  while  B*_,  =  0  (p)  follows  from  (3.1)  and  the  Chehychev 
inequality.  [1 

8.  On  assumption  (2.8).  Huber's  Proposal  2  for  robust  regression  is  to  solve 

n 

(3.12)  V  •  (lY  -  c  tl)/M)c.  =  0 

i  1 

tt  ,  o 

(3.! 3)  )  ^((Y.  -  c.  f.)/o )  K  i/’fZ)  , 

i  ='  1  1  1 


th'-  last  expectation  tal.tn  in .der  the  standard  normal  distribution  function.  Huber 
(1973)  shows  (2.8)  under  t  h  following  conditions.: 


0 


f ^ .  14)  !{'  is  odd  and  non -decreasing, 

\ji  has  two  bounded  continuous  derivatives, 
o  =  1  and  onl>  (3,12)  is  solved, 

yp  ►  0,  where  y  is  the  maxii  an  diagonal  element  of  t’'s  projection 

-  )  2 
matrix  C(C’C)  C'  =  CC  T  (since  y'-'p/n,  p  /n  ►  ,) 

is  necessary.) 

The  above  conditions  are  very  restrictive  in  assuming  >,  known,  and  Huber’s  f unc : i 

(;»(x)  =  max(-k,  min(k,x)))  does  not  satisfy  the  smoothness  condition. 

Carre  11  and  Ruppert  (1979)  have  generalized  Huber’s  result  by  means  of  the 

techniques  used  in  the  proot  of  Theorem  2.  They  utilize  the  full  system 

(3 . 1 2) - (3 . 1 3)  and  assume  only  that  <1<  satisfies  (2.14),  which  is  true  for  all 

functions  used  in  practice.  The  price  paid  is  a  stronger  condition  on  the  growth 

rate  of  p;  they  require  th.t  for  some  sequence  a^  ►  0,  both  yp/a  ►  0  and 

nya  -  0.  When  the  design  is  balanced  (  y  =  p/’n),  then  =  p  ^  for  any 

•U2c 

c  s  o  suffices,  but  that  requires  p  /n  ►  0. 

C.  Smoothness  of  F.  Condition  (2.5)  is  rather  .trong.  Ruppert  and  Carroll  (1979 
show  by  entirely  different  methods  that  when  p  is  fixed  ana  b  satisfies  (2.14),  (2.'>) 
can  be  relaxed  by  requiring  only  that  1  is  l.ipsch  z  of  order  one  in  neighbo  hr  -J  . 


of  -*c. 


1ft 


Wet oreuecs 

Ansco.tibe,  I.. I.  (19M).  I:\nininat  ion  of  residuals.  i'roa.  Fourth  bark*  leu  Syirp. 
Hath,  Statin t.  i'o.  >ia  1  -  3<> 

Rickel,  P..J.  (1978).  Using  residuals  robustly  i:  tests  for  hetcroscedasticity , 
non-linearity.  Ann.  Statist.  6,  266 - 2ft 1. 

Carroll,  R.J.  (1978).  On  ilmost  'hit  expansions  for  M-est i mates.  Ann.  Statist. 
(>,  514-318. 

Huber,  P.J.  (1973).  Robust  regression:  as  vnipt  ot  1 1  s  ,  conjectures  and  Vonte-C  ir  1  j  . 

dnu.  Statist.  1,  '’99-821. 

Huber,  (1977).  Robust  Statistical  Procedures.  SIAM,  Philadelphia. 


